Overview
Brought to you by YData
Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 119190 |
| Missing cells | 129208 |
| Missing cells (%) | 3.4% |
| Duplicate rows | 8154 |
| Duplicate rows (%) | 6.8% |
| Total size in memory | 29.1 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 14 |
| Text | 1 |
| DateTime | 1 |
| Dataset has 8154 (6.8%) duplicate rows | Duplicates |
agent is highly overall correlated with hotel | High correlation |
arrival_date_month is highly overall correlated with arrival_date_week_number | High correlation |
arrival_date_week_number is highly overall correlated with arrival_date_month | High correlation |
assigned_room_type is highly overall correlated with reserved_room_type | High correlation |
distribution_channel is highly overall correlated with market_segment | High correlation |
hotel is highly overall correlated with agent | High correlation |
is_canceled is highly overall correlated with reservation_status | High correlation |
market_segment is highly overall correlated with distribution_channel | High correlation |
reservation_status is highly overall correlated with is_canceled | High correlation |
reserved_room_type is highly overall correlated with assigned_room_type | High correlation |
children is highly imbalanced (80.7%) | Imbalance |
babies is highly imbalanced (97.2%) | Imbalance |
meal is highly imbalanced (53.5%) | Imbalance |
distribution_channel is highly imbalanced (63.2%) | Imbalance |
is_repeated_guest is highly imbalanced (79.6%) | Imbalance |
reserved_room_type is highly imbalanced (58.3%) | Imbalance |
assigned_room_type is highly imbalanced (51.4%) | Imbalance |
deposit_type is highly imbalanced (65.3%) | Imbalance |
customer_type is highly imbalanced (50.6%) | Imbalance |
required_car_parking_spaces is highly imbalanced (85.4%) | Imbalance |
agent has 16315 (13.7%) missing values | Missing |
company has 112402 (94.3%) missing values | Missing |
previous_cancellations is highly skewed (γ1 = 24.44031598) | Skewed |
previous_bookings_not_canceled is highly skewed (γ1 = 23.5438383) | Skewed |
lead_time has 6338 (5.3%) zeros | Zeros |
stays_in_weekend_nights has 51904 (43.5%) zeros | Zeros |
stays_in_week_nights has 7635 (6.4%) zeros | Zeros |
previous_cancellations has 112713 (94.6%) zeros | Zeros |
previous_bookings_not_canceled has 115573 (97.0%) zeros | Zeros |
booking_changes has 101147 (84.9%) zeros | Zeros |
days_in_waiting_list has 115501 (96.9%) zeros | Zeros |
adr has 1953 (1.6%) zeros | Zeros |
total_of_special_requests has 70203 (58.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-03 21:51:08.838921 |
|---|---|
| Analysis finished | 2025-01-03 21:52:27.910629 |
| Duration | 1 minute and 19.07 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
hotel
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| City Hotel | |
|---|---|
| Resort Hotel |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10.671029 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | City Hotel |
|---|---|
| 2nd row | City Hotel |
| 3rd row | City Hotel |
| 4th row | Resort Hotel |
| 5th row | Resort Hotel |
Common Values
| Value | Count | Frequency (%) |
| City Hotel | 79200 | |
| Resort Hotel | 39990 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hotel | 119190 | |
| city | 79200 | |
| resort | 39990 | 16.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 238380 | |
| o | 159180 | |
| e | 159180 | |
| 119190 | ||
| H | 119190 | |
| l | 119190 | |
| C | 79200 | 6.2% |
| i | 79200 | 6.2% |
| y | 79200 | 6.2% |
| R | 39990 | 3.1% |
| Other values (2) | 79980 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1271880 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 238380 | |
| o | 159180 | |
| e | 159180 | |
| 119190 | ||
| H | 119190 | |
| l | 119190 | |
| C | 79200 | 6.2% |
| i | 79200 | 6.2% |
| y | 79200 | 6.2% |
| R | 39990 | 3.1% |
| Other values (2) | 79980 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1271880 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 238380 | |
| o | 159180 | |
| e | 159180 | |
| 119190 | ||
| H | 119190 | |
| l | 119190 | |
| C | 79200 | 6.2% |
| i | 79200 | 6.2% |
| y | 79200 | 6.2% |
| R | 39990 | 3.1% |
| Other values (2) | 79980 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1271880 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 238380 | |
| o | 159180 | |
| e | 159180 | |
| 119190 | ||
| H | 119190 | |
| l | 119190 | |
| C | 79200 | 6.2% |
| i | 79200 | 6.2% |
| y | 79200 | 6.2% |
| R | 39990 | 3.1% |
| Other values (2) | 79980 | 6.3% |
is_canceled
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 75039 | |
| 1 | 44151 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 75039 | |
| 1 | 44151 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 75039 | |
| 1 | 44151 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 75039 | |
| 1 | 44151 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 75039 | |
| 1 | 44151 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 75039 | |
| 1 | 44151 |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 479 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 104.01917 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 6338 |
| Zeros (%) | 5.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 18 |
| median | 69 |
| Q3 | 160 |
| 95-th percentile | 320 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 142 |
Descriptive statistics
| Standard deviation | 106.87159 |
|---|---|
| Coefficient of variation (CV) | 1.027422 |
| Kurtosis | 1.6969412 |
| Mean | 104.01917 |
| Median Absolute Deviation (MAD) | 60 |
| Skewness | 1.3465801 |
| Sum | 12398045 |
| Variance | 11421.536 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6338 | 5.3% |
| 1 | 3456 | 2.9% |
| 2 | 2065 | 1.7% |
| 3 | 1814 | 1.5% |
| 4 | 1712 | 1.4% |
| 5 | 1563 | 1.3% |
| 6 | 1439 | 1.2% |
| 7 | 1330 | 1.1% |
| 8 | 1137 | 1.0% |
| 12 | 1079 | 0.9% |
| Other values (469) | 97257 |
| Value | Count | Frequency (%) |
| 0 | 6338 | |
| 1 | 3456 | |
| 2 | 2065 | 1.7% |
| 3 | 1814 | 1.5% |
| 4 | 1712 | 1.4% |
| 5 | 1563 | 1.3% |
| 6 | 1439 | 1.2% |
| 7 | 1330 | 1.1% |
| 8 | 1137 | 1.0% |
| 9 | 989 | 0.8% |
| Value | Count | Frequency (%) |
| 737 | 1 | < 0.1% |
| 709 | 1 | < 0.1% |
| 629 | 17 | |
| 626 | 30 | |
| 622 | 17 | |
| 615 | 17 | |
| 608 | 17 | |
| 605 | 30 | |
| 601 | 17 | |
| 594 | 17 |
arrival_date_year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| 2016 | |
|---|---|
| 2017 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2016 |
| 3rd row | 2016 |
| 4th row | 2016 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 56609 | |
| 2017 | 40612 | |
| 2015 | 21969 | 18.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2016 | 56609 | |
| 2017 | 40612 | |
| 2015 | 21969 | 18.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 119190 | |
| 0 | 119190 | |
| 1 | 119190 | |
| 6 | 56609 | |
| 7 | 40612 | 8.5% |
| 5 | 21969 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 476760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 119190 | |
| 0 | 119190 | |
| 1 | 119190 | |
| 6 | 56609 | |
| 7 | 40612 | 8.5% |
| 5 | 21969 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 476760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 119190 | |
| 0 | 119190 | |
| 1 | 119190 | |
| 6 | 56609 | |
| 7 | 40612 | 8.5% |
| 5 | 21969 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 476760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 119190 | |
| 0 | 119190 | |
| 1 | 119190 | |
| 6 | 56609 | |
| 7 | 40612 | 8.5% |
| 5 | 21969 | 4.6% |
arrival_date_month
Categorical
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| August | |
|---|---|
| July | |
| May | |
| October | |
| April | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.9034818 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | September |
|---|---|
| 2nd row | September |
| 3rd row | March |
| 4th row | April |
| 5th row | August |
Common Values
| Value | Count | Frequency (%) |
| August | 13856 | |
| July | 12642 | |
| May | 11764 | |
| October | 11144 | |
| April | 11070 | |
| June | 10919 | |
| September | 10495 | |
| March | 9775 | |
| February | 8056 | |
| November | 6775 | |
| Other values (2) | 12694 |
Length
| Value | Count | Frequency (%) |
| august | 13856 | |
| july | 12642 | |
| may | 11764 | |
| october | 11144 | |
| april | 11070 | |
| june | 10919 | |
| september | 10495 | |
| march | 9775 | |
| february | 8056 | |
| november | 6775 | |
| Other values (2) | 12694 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 95464 | |
| r | 78065 | 11.1% |
| u | 65253 | 9.3% |
| b | 43240 | 6.1% |
| a | 41443 | 5.9% |
| y | 38386 | 5.5% |
| t | 35495 | 5.0% |
| J | 29485 | 4.2% |
| c | 27689 | 3.9% |
| A | 24926 | 3.5% |
| Other values (16) | 224190 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 703636 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 95464 | |
| r | 78065 | 11.1% |
| u | 65253 | 9.3% |
| b | 43240 | 6.1% |
| a | 41443 | 5.9% |
| y | 38386 | 5.5% |
| t | 35495 | 5.0% |
| J | 29485 | 4.2% |
| c | 27689 | 3.9% |
| A | 24926 | 3.5% |
| Other values (16) | 224190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 703636 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 95464 | |
| r | 78065 | 11.1% |
| u | 65253 | 9.3% |
| b | 43240 | 6.1% |
| a | 41443 | 5.9% |
| y | 38386 | 5.5% |
| t | 35495 | 5.0% |
| J | 29485 | 4.2% |
| c | 27689 | 3.9% |
| A | 24926 | 3.5% |
| Other values (16) | 224190 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 703636 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 95464 | |
| r | 78065 | 11.1% |
| u | 65253 | 9.3% |
| b | 43240 | 6.1% |
| a | 41443 | 5.9% |
| y | 38386 | 5.5% |
| t | 35495 | 5.0% |
| J | 29485 | 4.2% |
| c | 27689 | 3.9% |
| A | 24926 | 3.5% |
| Other values (16) | 224190 |
arrival_date_week_number
Real number (ℝ)
High correlation 
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.165031 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 16 |
| median | 28 |
| Q3 | 38 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.605704 |
|---|---|
| Coefficient of variation (CV) | 0.50085363 |
| Kurtosis | -0.98601456 |
| Mean | 27.165031 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.010236308 |
| Sum | 3237800 |
| Variance | 185.11519 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 3574 | 3.0% |
| 30 | 3082 | 2.6% |
| 32 | 3038 | 2.5% |
| 34 | 3035 | 2.5% |
| 18 | 2920 | 2.4% |
| 28 | 2850 | 2.4% |
| 21 | 2847 | 2.4% |
| 17 | 2799 | 2.3% |
| 20 | 2776 | 2.3% |
| 29 | 2759 | 2.3% |
| Other values (43) | 89510 |
| Value | Count | Frequency (%) |
| 1 | 1047 | |
| 2 | 1218 | |
| 3 | 1317 | |
| 4 | 1484 | |
| 5 | 1387 | |
| 6 | 1505 | |
| 7 | 2104 | |
| 8 | 2213 | |
| 9 | 2115 | |
| 10 | 2143 |
| Value | Count | Frequency (%) |
| 53 | 1815 | |
| 52 | 1192 | |
| 51 | 932 | |
| 50 | 1500 | |
| 49 | 1782 | |
| 48 | 1499 | |
| 47 | 1680 | |
| 46 | 1570 | |
| 45 | 1936 | |
| 44 | 2272 |
arrival_date_day_of_month
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.799723 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.781984 |
|---|---|
| Coefficient of variation (CV) | 0.55583151 |
| Kurtosis | -1.1874011 |
| Mean | 15.799723 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.0022192577 |
| Sum | 1883169 |
| Variance | 77.123243 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 4401 | 3.7% |
| 5 | 4310 | 3.6% |
| 15 | 4188 | 3.5% |
| 25 | 4155 | 3.5% |
| 26 | 4143 | 3.5% |
| 9 | 4089 | 3.4% |
| 12 | 4084 | 3.4% |
| 16 | 4071 | 3.4% |
| 2 | 4051 | 3.4% |
| 19 | 4041 | 3.4% |
| Other values (21) | 77657 |
| Value | Count | Frequency (%) |
| 1 | 3620 | |
| 2 | 4051 | |
| 3 | 3852 | |
| 4 | 3754 | |
| 5 | 4310 | |
| 6 | 3825 | |
| 7 | 3656 | |
| 8 | 3915 | |
| 9 | 4089 | |
| 10 | 3565 |
| Value | Count | Frequency (%) |
| 31 | 2205 | |
| 30 | 3853 | |
| 29 | 3574 | |
| 28 | 3943 | |
| 27 | 3794 | |
| 26 | 4143 | |
| 25 | 4155 | |
| 24 | 3985 | |
| 23 | 3611 | |
| 22 | 3591 |
stays_in_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.92772045 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 51904 |
| Zeros (%) | 43.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.9986286 |
|---|---|
| Coefficient of variation (CV) | 1.0764327 |
| Kurtosis | 7.1807193 |
| Mean | 0.92772045 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3801488 |
| Sum | 110575 |
| Variance | 0.99725907 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 51904 | |
| 2 | 33259 | |
| 1 | 30574 | |
| 4 | 1852 | 1.6% |
| 3 | 1258 | 1.1% |
| 6 | 152 | 0.1% |
| 5 | 79 | 0.1% |
| 8 | 60 | 0.1% |
| 7 | 19 | < 0.1% |
| 9 | 11 | < 0.1% |
| Other values (7) | 22 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 51904 | |
| 1 | 30574 | |
| 2 | 33259 | |
| 3 | 1258 | 1.1% |
| 4 | 1852 | 1.6% |
| 5 | 79 | 0.1% |
| 6 | 152 | 0.1% |
| 7 | 19 | < 0.1% |
| 8 | 60 | 0.1% |
| 9 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 16 | 3 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 5 | < 0.1% |
| 10 | 7 | < 0.1% |
| 9 | 11 | < 0.1% |
| 8 | 60 | |
| 7 | 19 | < 0.1% |
stays_in_week_nights
Real number (ℝ)
Zeros 
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5004363 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 7635 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9082907 |
|---|---|
| Coefficient of variation (CV) | 0.76318308 |
| Kurtosis | 24.307258 |
| Mean | 2.5004363 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.8628027 |
| Sum | 298027 |
| Variance | 3.6415733 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 33618 | |
| 1 | 30255 | |
| 3 | 22222 | |
| 5 | 11062 | 9.3% |
| 4 | 9554 | 8.0% |
| 0 | 7635 | 6.4% |
| 6 | 1498 | 1.3% |
| 10 | 1035 | 0.9% |
| 7 | 1026 | 0.9% |
| 8 | 655 | 0.5% |
| Other values (25) | 630 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 7635 | 6.4% |
| 1 | 30255 | |
| 2 | 33618 | |
| 3 | 22222 | |
| 4 | 9554 | 8.0% |
| 5 | 11062 | 9.3% |
| 6 | 1498 | 1.3% |
| 7 | 1026 | 0.9% |
| 8 | 655 | 0.5% |
| 9 | 229 | 0.2% |
| Value | Count | Frequency (%) |
| 50 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 35 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 5 | |
| 26 | 1 | < 0.1% |
adults
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.856498 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 401 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.57939475 |
|---|---|
| Coefficient of variation (CV) | 0.31209015 |
| Kurtosis | 1353.1199 |
| Mean | 1.856498 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.336632 |
| Sum | 221276 |
| Variance | 0.33569828 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 89530 | |
| 1 | 22985 | 19.3% |
| 3 | 6196 | 5.2% |
| 0 | 401 | 0.3% |
| 4 | 62 | 0.1% |
| 26 | 5 | < 0.1% |
| 27 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 40 | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 401 | 0.3% |
| 1 | 22985 | 19.3% |
| 2 | 89530 | |
| 3 | 6196 | 5.2% |
| 4 | 62 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 26 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 26 | 5 | < 0.1% |
| 20 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 62 |
children
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 931.3 KiB |
| 0.0 | |
|---|---|
| 1.0 | 4855 |
| 2.0 | 3640 |
| 3.0 | 76 |
| 10.0 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0000084 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 110614 | |
| 1.0 | 4855 | 4.1% |
| 2.0 | 3640 | 3.1% |
| 3.0 | 76 | 0.1% |
| 10.0 | 1 | < 0.1% |
| (Missing) | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 110614 | |
| 1.0 | 4855 | 4.1% |
| 2.0 | 3640 | 3.1% |
| 3.0 | 76 | 0.1% |
| 10.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 229801 | |
| . | 119186 | |
| 1 | 4856 | 1.4% |
| 2 | 3640 | 1.0% |
| 3 | 76 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 357559 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 229801 | |
| . | 119186 | |
| 1 | 4856 | 1.4% |
| 2 | 3640 | 1.0% |
| 3 | 76 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 357559 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 229801 | |
| . | 119186 | |
| 1 | 4856 | 1.4% |
| 2 | 3640 | 1.0% |
| 3 | 76 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 357559 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 229801 | |
| . | 119186 | |
| 1 | 4856 | 1.4% |
| 2 | 3640 | 1.0% |
| 3 | 76 | < 0.1% |
babies
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| 0 | |
|---|---|
| 1 | 899 |
| 2 | 15 |
| 9 | 1 |
| 10 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000084 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 118274 | |
| 1 | 899 | 0.8% |
| 2 | 15 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 118274 | |
| 1 | 899 | 0.8% |
| 2 | 15 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 118275 | |
| 1 | 900 | 0.8% |
| 2 | 15 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 119191 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 118275 | |
| 1 | 900 | 0.8% |
| 2 | 15 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 119191 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 118275 | |
| 1 | 900 | 0.8% |
| 2 | 15 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 119191 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 118275 | |
| 1 | 900 | 0.8% |
| 2 | 15 | < 0.1% |
| 9 | 1 | < 0.1% |
meal
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| BB | |
|---|---|
| HB | |
| SC | |
| Undefined | 1169 |
| FB | 796 |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.0686551 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BB |
|---|---|
| 2nd row | SC |
| 3rd row | SC |
| 4th row | BB |
| 5th row | BB |
Common Values
| Value | Count | Frequency (%) |
| BB | 92160 | |
| HB | 14434 | 12.1% |
| SC | 10631 | 8.9% |
| Undefined | 1169 | 1.0% |
| FB | 796 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bb | 92160 | |
| hb | 14434 | 12.1% |
| sc | 10631 | 8.9% |
| undefined | 1169 | 1.0% |
| fb | 796 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 199550 | |
| H | 14434 | 5.9% |
| S | 10631 | 4.3% |
| C | 10631 | 4.3% |
| n | 2338 | 0.9% |
| d | 2338 | 0.9% |
| e | 2338 | 0.9% |
| U | 1169 | 0.5% |
| f | 1169 | 0.5% |
| i | 1169 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 246563 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 199550 | |
| H | 14434 | 5.9% |
| S | 10631 | 4.3% |
| C | 10631 | 4.3% |
| n | 2338 | 0.9% |
| d | 2338 | 0.9% |
| e | 2338 | 0.9% |
| U | 1169 | 0.5% |
| f | 1169 | 0.5% |
| i | 1169 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 246563 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 199550 | |
| H | 14434 | 5.9% |
| S | 10631 | 4.3% |
| C | 10631 | 4.3% |
| n | 2338 | 0.9% |
| d | 2338 | 0.9% |
| e | 2338 | 0.9% |
| U | 1169 | 0.5% |
| f | 1169 | 0.5% |
| i | 1169 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 246563 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 199550 | |
| H | 14434 | 5.9% |
| S | 10631 | 4.3% |
| C | 10631 | 4.3% |
| n | 2338 | 0.9% |
| d | 2338 | 0.9% |
| e | 2338 | 0.9% |
| U | 1169 | 0.5% |
| f | 1169 | 0.5% |
| i | 1169 | 0.5% |
country
Text
| Distinct | 177 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 487 |
| Missing (%) | 0.4% |
| Memory size | 931.3 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9892505 |
| Min length | 2 |
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BEL |
|---|---|
| 2nd row | DEU |
| 3rd row | ESP |
| 4th row | PRT |
| 5th row | PRT |
| Value | Count | Frequency (%) |
| prt | 48511 | |
| gbr | 12113 | 10.2% |
| fra | 10400 | 8.8% |
| esp | 8549 | 7.2% |
| deu | 7273 | 6.1% |
| ita | 3760 | 3.2% |
| irl | 3372 | 2.8% |
| bel | 2337 | 2.0% |
| bra | 2219 | 1.9% |
| nld | 2102 | 1.8% |
| Other values (167) | 18067 | 15.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 80676 | |
| P | 58405 | |
| T | 54175 | |
| A | 21590 | 6.1% |
| E | 21495 | 6.1% |
| B | 17025 | 4.8% |
| S | 13900 | 3.9% |
| U | 13268 | 3.7% |
| G | 13112 | 3.7% |
| F | 10940 | 3.1% |
| Other values (16) | 50247 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 354833 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 80676 | |
| P | 58405 | |
| T | 54175 | |
| A | 21590 | 6.1% |
| E | 21495 | 6.1% |
| B | 17025 | 4.8% |
| S | 13900 | 3.9% |
| U | 13268 | 3.7% |
| G | 13112 | 3.7% |
| F | 10940 | 3.1% |
| Other values (16) | 50247 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 354833 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 80676 | |
| P | 58405 | |
| T | 54175 | |
| A | 21590 | 6.1% |
| E | 21495 | 6.1% |
| B | 17025 | 4.8% |
| S | 13900 | 3.9% |
| U | 13268 | 3.7% |
| G | 13112 | 3.7% |
| F | 10940 | 3.1% |
| Other values (16) | 50247 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 354833 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 80676 | |
| P | 58405 | |
| T | 54175 | |
| A | 21590 | 6.1% |
| E | 21495 | 6.1% |
| B | 17025 | 4.8% |
| S | 13900 | 3.9% |
| U | 13268 | 3.7% |
| G | 13112 | 3.7% |
| F | 10940 | 3.1% |
| Other values (16) | 50247 |
market_segment
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| Online TA | |
|---|---|
| Offline TA/TO | |
| Groups | |
| Direct | |
| Corporate | 5292 |
| Other values (3) | 978 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.0195906 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Online TA |
|---|---|
| 2nd row | Online TA |
| 3rd row | Online TA |
| 4th row | Direct |
| 5th row | Direct |
Common Values
| Value | Count | Frequency (%) |
| Online TA | 56392 | |
| Offline TA/TO | 24170 | |
| Groups | 19777 | 16.6% |
| Direct | 12581 | 10.6% |
| Corporate | 5292 | 4.4% |
| Complementary | 741 | 0.6% |
| Aviation | 235 | 0.2% |
| Undefined | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 56392 | |
| ta | 56392 | |
| offline | 24170 | |
| ta/to | 24170 | |
| groups | 19777 | 9.9% |
| direct | 12581 | 6.3% |
| corporate | 5292 | 2.6% |
| complementary | 741 | 0.4% |
| aviation | 235 | 0.1% |
| undefined | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 137934 | |
| O | 104732 | |
| T | 104732 | |
| e | 99921 | |
| i | 93615 | |
| l | 81303 | |
| A | 80797 | |
| 80562 | ||
| f | 48342 | 4.5% |
| r | 43683 | 4.1% |
| Other values (16) | 199424 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1075045 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 137934 | |
| O | 104732 | |
| T | 104732 | |
| e | 99921 | |
| i | 93615 | |
| l | 81303 | |
| A | 80797 | |
| 80562 | ||
| f | 48342 | 4.5% |
| r | 43683 | 4.1% |
| Other values (16) | 199424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1075045 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 137934 | |
| O | 104732 | |
| T | 104732 | |
| e | 99921 | |
| i | 93615 | |
| l | 81303 | |
| A | 80797 | |
| 80562 | ||
| f | 48342 | 4.5% |
| r | 43683 | 4.1% |
| Other values (16) | 199424 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1075045 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 137934 | |
| O | 104732 | |
| T | 104732 | |
| e | 99921 | |
| i | 93615 | |
| l | 81303 | |
| A | 80797 | |
| 80562 | ||
| f | 48342 | 4.5% |
| r | 43683 | 4.1% |
| Other values (16) | 199424 |
distribution_channel
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 6670 |
| GDS | 193 |
| Undefined | 5 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.3434013 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TA/TO |
|---|---|
| 2nd row | TA/TO |
| 3rd row | TA/TO |
| 4th row | Direct |
| 5th row | Direct |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 97706 | |
| Direct | 14616 | 12.3% |
| Corporate | 6670 | 5.6% |
| GDS | 193 | 0.2% |
| Undefined | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ta/to | 97706 | |
| direct | 14616 | 12.3% |
| corporate | 6670 | 5.6% |
| gds | 193 | 0.2% |
| undefined | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 195412 | |
| / | 97706 | |
| O | 97706 | |
| A | 97706 | |
| r | 27956 | 4.4% |
| e | 21296 | 3.3% |
| t | 21286 | 3.3% |
| D | 14809 | 2.3% |
| i | 14621 | 2.3% |
| c | 14616 | 2.3% |
| Other values (10) | 33766 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 636880 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| T | 195412 | |
| / | 97706 | |
| O | 97706 | |
| A | 97706 | |
| r | 27956 | 4.4% |
| e | 21296 | 3.3% |
| t | 21286 | 3.3% |
| D | 14809 | 2.3% |
| i | 14621 | 2.3% |
| c | 14616 | 2.3% |
| Other values (10) | 33766 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 636880 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| T | 195412 | |
| / | 97706 | |
| O | 97706 | |
| A | 97706 | |
| r | 27956 | 4.4% |
| e | 21296 | 3.3% |
| t | 21286 | 3.3% |
| D | 14809 | 2.3% |
| i | 14621 | 2.3% |
| c | 14616 | 2.3% |
| Other values (10) | 33766 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 636880 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| T | 195412 | |
| / | 97706 | |
| O | 97706 | |
| A | 97706 | |
| r | 27956 | 4.4% |
| e | 21296 | 3.3% |
| t | 21286 | 3.3% |
| D | 14809 | 2.3% |
| i | 14621 | 2.3% |
| c | 14616 | 2.3% |
| Other values (10) | 33766 | 5.3% |
is_repeated_guest
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| 0 | |
|---|---|
| 1 | 3807 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 115383 | |
| 1 | 3807 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 115383 | |
| 1 | 3807 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 115383 | |
| 1 | 3807 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 115383 | |
| 1 | 3807 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 115383 | |
| 1 | 3807 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 115383 | |
| 1 | 3807 | 3.2% |
previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.087205302 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 112713 |
| Zeros (%) | 94.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.84500825 |
|---|---|
| Coefficient of variation (CV) | 9.6898724 |
| Kurtosis | 673.04527 |
| Mean | 0.087205302 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.440316 |
| Sum | 10394 |
| Variance | 0.71403895 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 112713 | |
| 1 | 6044 | 5.1% |
| 2 | 116 | 0.1% |
| 3 | 65 | 0.1% |
| 24 | 48 | < 0.1% |
| 11 | 35 | < 0.1% |
| 4 | 31 | < 0.1% |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 6 | 22 | < 0.1% |
| Other values (5) | 65 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 112713 | |
| 1 | 6044 | 5.1% |
| 2 | 116 | 0.1% |
| 3 | 65 | 0.1% |
| 4 | 31 | < 0.1% |
| 5 | 19 | < 0.1% |
| 6 | 22 | < 0.1% |
| 11 | 35 | < 0.1% |
| 13 | 12 | < 0.1% |
| 14 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 26 | 26 | |
| 25 | 25 | |
| 24 | 48 | |
| 21 | 1 | < 0.1% |
| 19 | 19 | < 0.1% |
| 14 | 14 | < 0.1% |
| 13 | 12 | < 0.1% |
| 11 | 35 | |
| 6 | 22 | |
| 5 | 19 | < 0.1% |
previous_bookings_not_canceled
Real number (ℝ)
Skewed  Zeros 
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13717594 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 115573 |
| Zeros (%) | 97.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.4979736 |
|---|---|
| Coefficient of variation (CV) | 10.92009 |
| Kurtosis | 767.32199 |
| Mean | 0.13717594 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.543838 |
| Sum | 16350 |
| Variance | 2.243925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 115573 | |
| 1 | 1540 | 1.3% |
| 2 | 580 | 0.5% |
| 3 | 333 | 0.3% |
| 4 | 229 | 0.2% |
| 5 | 181 | 0.2% |
| 6 | 115 | 0.1% |
| 7 | 88 | 0.1% |
| 8 | 70 | 0.1% |
| 9 | 60 | 0.1% |
| Other values (63) | 421 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 115573 | |
| 1 | 1540 | 1.3% |
| 2 | 580 | 0.5% |
| 3 | 333 | 0.3% |
| 4 | 229 | 0.2% |
| 5 | 181 | 0.2% |
| 6 | 115 | 0.1% |
| 7 | 88 | 0.1% |
| 8 | 70 | 0.1% |
| 9 | 60 | 0.1% |
| Value | Count | Frequency (%) |
| 72 | 1 | |
| 71 | 1 | |
| 70 | 1 | |
| 69 | 1 | |
| 68 | 1 | |
| 67 | 1 | |
| 66 | 1 | |
| 65 | 1 | |
| 64 | 1 | |
| 63 | 1 |
reserved_room_type
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| A | |
|---|---|
| D | |
| E | 6521 |
| F | 2891 |
| G | 2091 |
| Other values (5) | 2664 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | D |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| A | 85847 | |
| D | 19176 | 16.1% |
| E | 6521 | 5.5% |
| F | 2891 | 2.4% |
| G | 2091 | 1.8% |
| B | 1116 | 0.9% |
| C | 930 | 0.8% |
| H | 600 | 0.5% |
| P | 12 | < 0.1% |
| L | 6 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 85847 | |
| d | 19176 | 16.1% |
| e | 6521 | 5.5% |
| f | 2891 | 2.4% |
| g | 2091 | 1.8% |
| b | 1116 | 0.9% |
| c | 930 | 0.8% |
| h | 600 | 0.5% |
| p | 12 | < 0.1% |
| l | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 85847 | |
| D | 19176 | 16.1% |
| E | 6521 | 5.5% |
| F | 2891 | 2.4% |
| G | 2091 | 1.8% |
| B | 1116 | 0.9% |
| C | 930 | 0.8% |
| H | 600 | 0.5% |
| P | 12 | < 0.1% |
| L | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 85847 | |
| D | 19176 | 16.1% |
| E | 6521 | 5.5% |
| F | 2891 | 2.4% |
| G | 2091 | 1.8% |
| B | 1116 | 0.9% |
| C | 930 | 0.8% |
| H | 600 | 0.5% |
| P | 12 | < 0.1% |
| L | 6 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 85847 | |
| D | 19176 | 16.1% |
| E | 6521 | 5.5% |
| F | 2891 | 2.4% |
| G | 2091 | 1.8% |
| B | 1116 | 0.9% |
| C | 930 | 0.8% |
| H | 600 | 0.5% |
| P | 12 | < 0.1% |
| L | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 85847 | |
| D | 19176 | 16.1% |
| E | 6521 | 5.5% |
| F | 2891 | 2.4% |
| G | 2091 | 1.8% |
| B | 1116 | 0.9% |
| C | 930 | 0.8% |
| H | 600 | 0.5% |
| P | 12 | < 0.1% |
| L | 6 | < 0.1% |
assigned_room_type
Categorical
High correlation  Imbalance 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| A | |
|---|---|
| D | |
| E | |
| F | 3744 |
| G | 2548 |
| Other values (7) | 5897 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | D |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| A | 73924 | |
| D | 25287 | 21.2% |
| E | 7790 | 6.5% |
| F | 3744 | 3.1% |
| G | 2548 | 2.1% |
| C | 2373 | 2.0% |
| B | 2160 | 1.8% |
| H | 710 | 0.6% |
| I | 362 | 0.3% |
| K | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| a | 73924 | |
| d | 25287 | 21.2% |
| e | 7790 | 6.5% |
| f | 3744 | 3.1% |
| g | 2548 | 2.1% |
| c | 2373 | 2.0% |
| b | 2160 | 1.8% |
| h | 710 | 0.6% |
| i | 362 | 0.3% |
| k | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 73924 | |
| D | 25287 | 21.2% |
| E | 7790 | 6.5% |
| F | 3744 | 3.1% |
| G | 2548 | 2.1% |
| C | 2373 | 2.0% |
| B | 2160 | 1.8% |
| H | 710 | 0.6% |
| I | 362 | 0.3% |
| K | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 73924 | |
| D | 25287 | 21.2% |
| E | 7790 | 6.5% |
| F | 3744 | 3.1% |
| G | 2548 | 2.1% |
| C | 2373 | 2.0% |
| B | 2160 | 1.8% |
| H | 710 | 0.6% |
| I | 362 | 0.3% |
| K | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 73924 | |
| D | 25287 | 21.2% |
| E | 7790 | 6.5% |
| F | 3744 | 3.1% |
| G | 2548 | 2.1% |
| C | 2373 | 2.0% |
| B | 2160 | 1.8% |
| H | 710 | 0.6% |
| I | 362 | 0.3% |
| K | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 73924 | |
| D | 25287 | 21.2% |
| E | 7790 | 6.5% |
| F | 3744 | 3.1% |
| G | 2548 | 2.1% |
| C | 2373 | 2.0% |
| B | 2160 | 1.8% |
| H | 710 | 0.6% |
| I | 362 | 0.3% |
| K | 279 | 0.2% |
| Other values (2) | 13 | < 0.1% |
booking_changes
Real number (ℝ)
Zeros 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22105881 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 101147 |
| Zeros (%) | 84.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.65188303 |
|---|---|
| Coefficient of variation (CV) | 2.9489122 |
| Kurtosis | 79.316679 |
| Mean | 0.22105881 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.9917234 |
| Sum | 26348 |
| Variance | 0.42495148 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 101147 | |
| 1 | 12678 | 10.6% |
| 2 | 3797 | 3.2% |
| 3 | 926 | 0.8% |
| 4 | 376 | 0.3% |
| 5 | 118 | 0.1% |
| 6 | 63 | 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 8 | < 0.1% |
| Other values (11) | 29 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 101147 | |
| 1 | 12678 | 10.6% |
| 2 | 3797 | 3.2% |
| 3 | 926 | 0.8% |
| 4 | 376 | 0.3% |
| 5 | 118 | 0.1% |
| 6 | 63 | 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 3 | |
| 14 | 5 | |
| 13 | 5 | |
| 12 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
deposit_type
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| No Deposit | |
|---|---|
| Non Refund | |
| Refundable | 161 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 104466 | |
| Non Refund | 14563 | 12.2% |
| Refundable | 161 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 104466 | |
| deposit | 104466 | |
| non | 14563 | 6.1% |
| refund | 14563 | 6.1% |
| refundable | 161 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 223495 | |
| e | 119351 | |
| N | 119029 | |
| 119029 | ||
| s | 104466 | |
| i | 104466 | |
| t | 104466 | |
| p | 104466 | |
| D | 104466 | |
| n | 29287 | 2.5% |
| Other values (7) | 59379 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1191900 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 223495 | |
| e | 119351 | |
| N | 119029 | |
| 119029 | ||
| s | 104466 | |
| i | 104466 | |
| t | 104466 | |
| p | 104466 | |
| D | 104466 | |
| n | 29287 | 2.5% |
| Other values (7) | 59379 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1191900 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 223495 | |
| e | 119351 | |
| N | 119029 | |
| 119029 | ||
| s | 104466 | |
| i | 104466 | |
| t | 104466 | |
| p | 104466 | |
| D | 104466 | |
| n | 29287 | 2.5% |
| Other values (7) | 59379 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1191900 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 223495 | |
| e | 119351 | |
| N | 119029 | |
| 119029 | ||
| s | 104466 | |
| i | 104466 | |
| t | 104466 | |
| p | 104466 | |
| D | 104466 | |
| n | 29287 | 2.5% |
| Other values (7) | 59379 | 5.0% |
agent
Real number (ℝ)
High correlation  Missing 
| Distinct | 333 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 16315 |
| Missing (%) | 13.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.680389 |
| Minimum | 1 |
|---|---|
| Maximum | 535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 14 |
| Q3 | 229 |
| 95-th percentile | 250 |
| Maximum | 535 |
| Range | 534 |
| Interquartile range (IQR) | 220 |
Descriptive statistics
| Standard deviation | 110.7661 |
|---|---|
| Coefficient of variation (CV) | 1.277868 |
| Kurtosis | -0.0085158428 |
| Mean | 86.680389 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 1.0892704 |
| Sum | 8917245 |
| Variance | 12269.128 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 31915 | |
| 240 | 13901 | |
| 1 | 7182 | 6.0% |
| 14 | 3635 | 3.0% |
| 7 | 3533 | 3.0% |
| 6 | 3283 | 2.8% |
| 250 | 2863 | 2.4% |
| 241 | 1718 | 1.4% |
| 28 | 1663 | 1.4% |
| 8 | 1509 | 1.3% |
| Other values (323) | 31673 | |
| (Missing) | 16315 |
| Value | Count | Frequency (%) |
| 1 | 7182 | 6.0% |
| 2 | 161 | 0.1% |
| 3 | 1333 | 1.1% |
| 4 | 47 | < 0.1% |
| 5 | 330 | 0.3% |
| 6 | 3283 | 2.8% |
| 7 | 3533 | 3.0% |
| 8 | 1509 | 1.3% |
| 9 | 31915 | |
| 10 | 260 | 0.2% |
| Value | Count | Frequency (%) |
| 535 | 3 | < 0.1% |
| 531 | 68 | |
| 527 | 35 | |
| 526 | 8 | < 0.1% |
| 510 | 2 | < 0.1% |
| 509 | 10 | < 0.1% |
| 508 | 6 | < 0.1% |
| 502 | 24 | < 0.1% |
| 497 | 1 | < 0.1% |
| 495 | 57 |
company
Real number (ℝ)
Missing 
| Distinct | 352 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 112402 |
| Missing (%) | 94.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 189.1868 |
| Minimum | 6 |
|---|---|
| Maximum | 543 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 62 |
| median | 178 |
| Q3 | 270 |
| 95-th percentile | 435 |
| Maximum | 543 |
| Range | 537 |
| Interquartile range (IQR) | 208 |
Descriptive statistics
| Standard deviation | 131.69555 |
|---|---|
| Coefficient of variation (CV) | 0.69611383 |
| Kurtosis | -0.49024666 |
| Mean | 189.1868 |
| Median Absolute Deviation (MAD) | 110 |
| Skewness | 0.6028196 |
| Sum | 1284200 |
| Variance | 17343.718 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 927 | 0.8% |
| 223 | 782 | 0.7% |
| 67 | 267 | 0.2% |
| 45 | 250 | 0.2% |
| 153 | 213 | 0.2% |
| 174 | 149 | 0.1% |
| 219 | 141 | 0.1% |
| 281 | 137 | 0.1% |
| 154 | 133 | 0.1% |
| 405 | 119 | 0.1% |
| Other values (342) | 3670 | 3.1% |
| (Missing) | 112402 |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 37 | |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 14 | < 0.1% |
| 14 | 9 | < 0.1% |
| 16 | 5 | < 0.1% |
| 18 | 1 | < 0.1% |
| 20 | 50 |
| Value | Count | Frequency (%) |
| 543 | 2 | < 0.1% |
| 541 | 1 | < 0.1% |
| 539 | 2 | < 0.1% |
| 534 | 2 | < 0.1% |
| 531 | 1 | < 0.1% |
| 530 | 5 | < 0.1% |
| 528 | 2 | < 0.1% |
| 525 | 15 | |
| 523 | 19 | |
| 521 | 7 | < 0.1% |
days_in_waiting_list
Real number (ℝ)
Zeros 
| Distinct | 128 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.316243 |
| Minimum | 0 |
|---|---|
| Maximum | 391 |
| Zeros | 115501 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 391 |
| Range | 391 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17.566855 |
|---|---|
| Coefficient of variation (CV) | 7.5842023 |
| Kurtosis | 187.6359 |
| Mean | 2.316243 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.965906 |
| Sum | 276073 |
| Variance | 308.59441 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 115501 | |
| 39 | 226 | 0.2% |
| 58 | 164 | 0.1% |
| 44 | 141 | 0.1% |
| 31 | 127 | 0.1% |
| 35 | 96 | 0.1% |
| 46 | 94 | 0.1% |
| 69 | 89 | 0.1% |
| 63 | 83 | 0.1% |
| 87 | 80 | 0.1% |
| Other values (118) | 2589 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 115501 | |
| 1 | 12 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 59 | < 0.1% |
| 4 | 25 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 391 | 45 | |
| 379 | 15 | < 0.1% |
| 330 | 15 | < 0.1% |
| 259 | 10 | < 0.1% |
| 236 | 34 | |
| 224 | 10 | < 0.1% |
| 223 | 59 | |
| 215 | 21 | < 0.1% |
| 207 | 15 | < 0.1% |
| 193 | 1 | < 0.1% |
customer_type
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 4075 |
| Group | 577 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.208566 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient |
| 3rd row | Transient |
| 4th row | Transient |
| 5th row | Transient |
Common Values
| Value | Count | Frequency (%) |
| Transient | 89466 | |
| Transient-Party | 25072 | 21.0% |
| Contract | 4075 | 3.4% |
| Group | 577 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| transient | 89466 | |
| transient-party | 25072 | 21.0% |
| contract | 4075 | 3.4% |
| group | 577 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 233151 | |
| t | 147760 | |
| r | 144262 | |
| a | 143685 | |
| T | 114538 | |
| s | 114538 | |
| i | 114538 | |
| e | 114538 | |
| y | 25072 | 2.1% |
| - | 25072 | 2.1% |
| Other values (7) | 39605 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1216759 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 233151 | |
| t | 147760 | |
| r | 144262 | |
| a | 143685 | |
| T | 114538 | |
| s | 114538 | |
| i | 114538 | |
| e | 114538 | |
| y | 25072 | 2.1% |
| - | 25072 | 2.1% |
| Other values (7) | 39605 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1216759 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 233151 | |
| t | 147760 | |
| r | 144262 | |
| a | 143685 | |
| T | 114538 | |
| s | 114538 | |
| i | 114538 | |
| e | 114538 | |
| y | 25072 | 2.1% |
| - | 25072 | 2.1% |
| Other values (7) | 39605 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1216759 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 233151 | |
| t | 147760 | |
| r | 144262 | |
| a | 143685 | |
| T | 114538 | |
| s | 114538 | |
| i | 114538 | |
| e | 114538 | |
| y | 25072 | 2.1% |
| - | 25072 | 2.1% |
| Other values (7) | 39605 | 3.3% |
adr
Real number (ℝ)
Zeros 
| Distinct | 8871 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.82772 |
| Minimum | -6.38 |
|---|---|
| Maximum | 5400 |
| Zeros | 1953 |
| Zeros (%) | 1.6% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | -6.38 |
|---|---|
| 5-th percentile | 38.4 |
| Q1 | 69.2 |
| median | 94.5 |
| Q3 | 126 |
| 95-th percentile | 193.5 |
| Maximum | 5400 |
| Range | 5406.38 |
| Interquartile range (IQR) | 56.8 |
Descriptive statistics
| Standard deviation | 50.537121 |
|---|---|
| Coefficient of variation (CV) | 0.49630024 |
| Kurtosis | 1014.7826 |
| Mean | 101.82772 |
| Median Absolute Deviation (MAD) | 27.9 |
| Skewness | 10.545542 |
| Sum | 12136846 |
| Variance | 2554.0006 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 3752 | 3.1% |
| 75 | 2710 | 2.3% |
| 90 | 2470 | 2.1% |
| 65 | 2414 | 2.0% |
| 0 | 1953 | 1.6% |
| 80 | 1883 | 1.6% |
| 95 | 1658 | 1.4% |
| 120 | 1603 | 1.3% |
| 100 | 1572 | 1.3% |
| 85 | 1537 | 1.3% |
| Other values (8861) | 97638 |
| Value | Count | Frequency (%) |
| -6.38 | 1 | < 0.1% |
| 0 | 1953 | |
| 0.26 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 1 | 15 | < 0.1% |
| 1.29 | 1 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.56 | 2 | < 0.1% |
| 1.6 | 1 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5400 | 1 | |
| 510 | 1 | |
| 508 | 1 | |
| 451.5 | 1 | |
| 450 | 1 | |
| 437 | 1 | |
| 426.25 | 1 | |
| 402 | 1 | |
| 397.38 | 1 | |
| 392 | 2 |
required_car_parking_spaces
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| 0 | |
|---|---|
| 1 | 7365 |
| 2 | 27 |
| 3 | 3 |
| 8 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 111793 | |
| 1 | 7365 | 6.2% |
| 2 | 27 | < 0.1% |
| 3 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 111793 | |
| 1 | 7365 | 6.2% |
| 2 | 27 | < 0.1% |
| 3 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 111793 | |
| 1 | 7365 | 6.2% |
| 2 | 27 | < 0.1% |
| 3 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 111793 | |
| 1 | 7365 | 6.2% |
| 2 | 27 | < 0.1% |
| 3 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 111793 | |
| 1 | 7365 | 6.2% |
| 2 | 27 | < 0.1% |
| 3 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 119190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 111793 | |
| 1 | 7365 | 6.2% |
| 2 | 27 | < 0.1% |
| 3 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
total_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.57139022 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 70203 |
| Zeros (%) | 58.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 931.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.79287386 |
|---|---|
| Coefficient of variation (CV) | 1.3876224 |
| Kurtosis | 1.4918075 |
| Mean | 0.57139022 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.349148 |
| Sum | 68104 |
| Variance | 0.62864896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 70203 | |
| 1 | 33163 | |
| 2 | 12950 | 10.9% |
| 3 | 2495 | 2.1% |
| 4 | 339 | 0.3% |
| 5 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 70203 | |
| 1 | 33163 | |
| 2 | 12950 | 10.9% |
| 3 | 2495 | 2.1% |
| 4 | 339 | 0.3% |
| 5 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 40 | < 0.1% |
| 4 | 339 | 0.3% |
| 3 | 2495 | 2.1% |
| 2 | 12950 | 10.9% |
| 1 | 33163 | |
| 0 | 70203 |
reservation_status
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| Check-Out | |
|---|---|
| Canceled | |
| No-Show | 1204 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.6194731 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Check-Out |
|---|---|
| 2nd row | Check-Out |
| 3rd row | Check-Out |
| 4th row | Canceled |
| 5th row | Check-Out |
Common Values
| Value | Count | Frequency (%) |
| Check-Out | 75039 | |
| Canceled | 42947 | |
| No-Show | 1204 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| check-out | 75039 | |
| canceled | 42947 | |
| no-show | 1204 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 160933 | |
| C | 117986 | |
| c | 117986 | |
| h | 76243 | |
| - | 76243 | |
| u | 75039 | |
| t | 75039 | |
| O | 75039 | |
| k | 75039 | |
| a | 42947 | 4.2% |
| Other values (7) | 134861 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1027355 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 160933 | |
| C | 117986 | |
| c | 117986 | |
| h | 76243 | |
| - | 76243 | |
| u | 75039 | |
| t | 75039 | |
| O | 75039 | |
| k | 75039 | |
| a | 42947 | 4.2% |
| Other values (7) | 134861 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1027355 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 160933 | |
| C | 117986 | |
| c | 117986 | |
| h | 76243 | |
| - | 76243 | |
| u | 75039 | |
| t | 75039 | |
| O | 75039 | |
| k | 75039 | |
| a | 42947 | 4.2% |
| Other values (7) | 134861 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1027355 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 160933 | |
| C | 117986 | |
| c | 117986 | |
| h | 76243 | |
| - | 76243 | |
| u | 75039 | |
| t | 75039 | |
| O | 75039 | |
| k | 75039 | |
| a | 42947 | 4.2% |
| Other values (7) | 134861 |
| Distinct | 926 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 931.3 KiB |
| Minimum | 2014-10-17 00:00:00 |
|---|---|
| Maximum | 2017-09-14 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Interactions
Correlations
| adr | adults | agent | arrival_date_day_of_month | arrival_date_month | arrival_date_week_number | arrival_date_year | assigned_room_type | babies | booking_changes | children | company | customer_type | days_in_waiting_list | deposit_type | distribution_channel | hotel | is_canceled | is_repeated_guest | lead_time | market_segment | meal | previous_bookings_not_canceled | previous_cancellations | required_car_parking_spaces | reservation_status | reserved_room_type | stays_in_week_nights | stays_in_weekend_nights | total_of_special_requests | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| adr | 1.000 | 0.280 | -0.049 | 0.027 | 0.001 | 0.074 | 0.000 | 0.000 | 0.000 | 0.005 | 0.000 | 0.052 | 0.000 | -0.039 | 0.007 | 0.000 | 0.000 | 0.000 | 0.000 | 0.015 | 0.000 | 0.000 | -0.143 | -0.150 | 0.000 | 0.000 | 0.000 | 0.094 | 0.051 | 0.196 |
| adults | 0.280 | 1.000 | -0.056 | 0.002 | 0.010 | 0.026 | 0.015 | 0.000 | 0.000 | -0.085 | 0.000 | 0.231 | 0.089 | -0.037 | 0.000 | 0.008 | 0.014 | 0.013 | 0.000 | 0.192 | 0.008 | 0.000 | -0.210 | -0.036 | 0.000 | 0.008 | 0.003 | 0.153 | 0.127 | 0.162 |
| agent | -0.049 | -0.056 | 1.000 | 0.005 | 0.083 | -0.057 | 0.091 | 0.133 | 0.026 | 0.091 | 0.058 | 0.226 | 0.125 | -0.019 | 0.119 | 0.209 | 0.817 | 0.086 | 0.076 | -0.124 | 0.222 | 0.185 | 0.060 | -0.168 | 0.131 | 0.064 | 0.143 | 0.171 | 0.131 | 0.016 |
| arrival_date_day_of_month | 0.027 | 0.002 | 0.005 | 1.000 | 0.058 | 0.061 | 0.044 | 0.009 | 0.005 | 0.013 | 0.010 | 0.045 | 0.032 | 0.032 | 0.054 | 0.028 | 0.026 | 0.021 | 0.018 | 0.008 | 0.033 | 0.039 | -0.002 | -0.012 | 0.007 | 0.023 | 0.010 | -0.016 | -0.007 | 0.003 |
| arrival_date_month | 0.001 | 0.010 | 0.083 | 0.058 | 1.000 | 0.801 | 0.429 | 0.027 | 0.016 | 0.010 | 0.069 | 0.217 | 0.103 | 0.060 | 0.101 | 0.068 | 0.070 | 0.069 | 0.075 | 0.132 | 0.088 | 0.089 | 0.017 | 0.032 | 0.018 | 0.065 | 0.045 | 0.037 | 0.046 | 0.053 |
| arrival_date_week_number | 0.074 | 0.026 | -0.057 | 0.061 | 0.801 | 1.000 | 0.424 | 0.028 | 0.014 | 0.008 | 0.062 | -0.058 | 0.106 | -0.004 | 0.095 | 0.064 | 0.067 | 0.066 | 0.076 | 0.112 | 0.081 | 0.080 | -0.043 | 0.087 | 0.017 | 0.061 | 0.042 | 0.026 | 0.026 | 0.019 |
| arrival_date_year | 0.000 | 0.015 | 0.091 | 0.044 | 0.429 | 0.424 | 1.000 | 0.053 | 0.009 | 0.016 | 0.044 | 0.281 | 0.214 | 0.074 | 0.052 | 0.027 | 0.043 | 0.026 | 0.010 | 0.104 | 0.159 | 0.112 | 0.025 | 0.052 | 0.018 | 0.023 | 0.082 | 0.014 | 0.029 | 0.091 |
| assigned_room_type | 0.000 | 0.000 | 0.133 | 0.009 | 0.027 | 0.028 | 0.053 | 1.000 | 0.044 | 0.052 | 0.304 | 0.085 | 0.090 | 0.029 | 0.192 | 0.095 | 0.391 | 0.203 | 0.071 | 0.062 | 0.121 | 0.116 | 0.003 | 0.008 | 0.092 | 0.145 | 0.776 | 0.047 | 0.051 | 0.066 |
| babies | 0.000 | 0.000 | 0.026 | 0.005 | 0.016 | 0.014 | 0.009 | 0.044 | 1.000 | 0.017 | 0.025 | 0.032 | 0.015 | 0.000 | 0.023 | 0.029 | 0.049 | 0.034 | 0.007 | 0.007 | 0.034 | 0.016 | 0.000 | 0.000 | 0.020 | 0.024 | 0.040 | 0.000 | 0.010 | 0.060 |
| booking_changes | 0.005 | -0.085 | 0.091 | 0.013 | 0.010 | 0.008 | 0.016 | 0.052 | 0.017 | 1.000 | 0.017 | 0.176 | 0.028 | -0.019 | 0.029 | 0.027 | 0.040 | 0.048 | 0.000 | -0.008 | 0.020 | 0.010 | 0.031 | -0.073 | 0.017 | 0.034 | 0.014 | 0.065 | 0.040 | 0.042 |
| children | 0.000 | 0.000 | 0.058 | 0.010 | 0.069 | 0.062 | 0.044 | 0.304 | 0.025 | 0.017 | 1.000 | 0.039 | 0.061 | 0.018 | 0.073 | 0.043 | 0.046 | 0.028 | 0.035 | 0.028 | 0.100 | 0.037 | 0.002 | 0.000 | 0.030 | 0.028 | 0.357 | 0.013 | 0.028 | 0.061 |
| company | 0.052 | 0.231 | 0.226 | 0.045 | 0.217 | -0.058 | 0.281 | 0.085 | 0.032 | 0.176 | 0.039 | 1.000 | 0.251 | 0.021 | 0.183 | 0.218 | 0.498 | 0.141 | 0.358 | 0.286 | 0.392 | 0.200 | -0.298 | -0.198 | 0.047 | 0.106 | 0.098 | 0.250 | 0.076 | -0.128 |
| customer_type | 0.000 | 0.089 | 0.125 | 0.032 | 0.103 | 0.106 | 0.214 | 0.090 | 0.015 | 0.028 | 0.061 | 0.251 | 1.000 | 0.078 | 0.098 | 0.079 | 0.052 | 0.136 | 0.105 | 0.122 | 0.276 | 0.139 | 0.014 | 0.010 | 0.041 | 0.097 | 0.109 | 0.080 | 0.088 | 0.097 |
| days_in_waiting_list | -0.039 | -0.037 | -0.019 | 0.032 | 0.060 | -0.004 | 0.074 | 0.029 | 0.000 | -0.019 | 0.018 | 0.021 | 0.078 | 1.000 | 0.127 | 0.027 | 0.087 | 0.067 | 0.024 | 0.153 | 0.078 | 0.061 | -0.019 | 0.116 | 0.034 | 0.050 | 0.028 | 0.012 | -0.075 | -0.123 |
| deposit_type | 0.007 | 0.000 | 0.119 | 0.054 | 0.101 | 0.095 | 0.052 | 0.192 | 0.023 | 0.029 | 0.073 | 0.183 | 0.098 | 0.127 | 1.000 | 0.091 | 0.176 | 0.481 | 0.058 | 0.274 | 0.374 | 0.093 | 0.013 | 0.051 | 0.071 | 0.347 | 0.152 | 0.047 | 0.073 | 0.220 |
| distribution_channel | 0.000 | 0.008 | 0.209 | 0.028 | 0.068 | 0.064 | 0.027 | 0.095 | 0.029 | 0.027 | 0.043 | 0.218 | 0.079 | 0.027 | 0.091 | 1.000 | 0.187 | 0.177 | 0.297 | 0.116 | 0.692 | 0.077 | 0.108 | 0.051 | 0.076 | 0.129 | 0.100 | 0.006 | 0.055 | 0.070 |
| hotel | 0.000 | 0.014 | 0.817 | 0.026 | 0.070 | 0.067 | 0.043 | 0.391 | 0.049 | 0.040 | 0.046 | 0.498 | 0.052 | 0.087 | 0.176 | 0.187 | 1.000 | 0.136 | 0.050 | 0.094 | 0.147 | 0.317 | 0.017 | 0.050 | 0.220 | 0.136 | 0.323 | 0.192 | 0.198 | 0.046 |
| is_canceled | 0.000 | 0.013 | 0.086 | 0.021 | 0.069 | 0.066 | 0.026 | 0.203 | 0.034 | 0.048 | 0.028 | 0.141 | 0.136 | 0.067 | 0.481 | 0.177 | 0.136 | 1.000 | 0.085 | 0.281 | 0.267 | 0.050 | 0.041 | 0.044 | 0.197 | 1.000 | 0.073 | 0.028 | 0.022 | 0.265 |
| is_repeated_guest | 0.000 | 0.000 | 0.076 | 0.018 | 0.075 | 0.076 | 0.010 | 0.071 | 0.007 | 0.000 | 0.035 | 0.358 | 0.105 | 0.024 | 0.058 | 0.297 | 0.050 | 0.085 | 1.000 | 0.134 | 0.347 | 0.061 | 0.320 | 0.185 | 0.078 | 0.086 | 0.037 | 0.017 | 0.082 | 0.040 |
| lead_time | 0.015 | 0.192 | -0.124 | 0.008 | 0.132 | 0.112 | 0.104 | 0.062 | 0.007 | -0.008 | 0.028 | 0.286 | 0.122 | 0.153 | 0.274 | 0.116 | 0.094 | 0.281 | 0.134 | 1.000 | 0.170 | 0.089 | -0.189 | 0.171 | 0.057 | 0.207 | 0.048 | 0.296 | 0.162 | -0.074 |
| market_segment | 0.000 | 0.008 | 0.222 | 0.033 | 0.088 | 0.081 | 0.159 | 0.121 | 0.034 | 0.020 | 0.100 | 0.392 | 0.276 | 0.078 | 0.374 | 0.692 | 0.147 | 0.267 | 0.347 | 0.170 | 1.000 | 0.191 | 0.097 | 0.054 | 0.092 | 0.195 | 0.138 | 0.033 | 0.061 | 0.210 |
| meal | 0.000 | 0.000 | 0.185 | 0.039 | 0.089 | 0.080 | 0.112 | 0.116 | 0.016 | 0.010 | 0.037 | 0.200 | 0.139 | 0.061 | 0.093 | 0.077 | 0.317 | 0.050 | 0.061 | 0.089 | 0.191 | 1.000 | 0.014 | 0.088 | 0.026 | 0.040 | 0.103 | 0.045 | 0.061 | 0.062 |
| previous_bookings_not_canceled | -0.143 | -0.210 | 0.060 | -0.002 | 0.017 | -0.043 | 0.025 | 0.003 | 0.000 | 0.031 | 0.002 | -0.298 | 0.014 | -0.019 | 0.013 | 0.108 | 0.017 | 0.041 | 0.320 | -0.189 | 0.097 | 0.014 | 1.000 | 0.102 | 0.019 | 0.029 | 0.003 | -0.119 | -0.084 | 0.025 |
| previous_cancellations | -0.150 | -0.036 | -0.168 | -0.012 | 0.032 | 0.087 | 0.052 | 0.008 | 0.000 | -0.073 | 0.000 | -0.198 | 0.010 | 0.116 | 0.051 | 0.051 | 0.050 | 0.044 | 0.185 | 0.171 | 0.054 | 0.088 | 0.102 | 1.000 | 0.000 | 0.031 | 0.006 | -0.062 | -0.055 | -0.130 |
| required_car_parking_spaces | 0.000 | 0.000 | 0.131 | 0.007 | 0.018 | 0.017 | 0.018 | 0.092 | 0.020 | 0.017 | 0.030 | 0.047 | 0.041 | 0.034 | 0.071 | 0.076 | 0.220 | 0.197 | 0.078 | 0.057 | 0.092 | 0.026 | 0.019 | 0.000 | 1.000 | 0.139 | 0.079 | 0.017 | 0.015 | 0.044 |
| reservation_status | 0.000 | 0.008 | 0.064 | 0.023 | 0.065 | 0.061 | 0.023 | 0.145 | 0.024 | 0.034 | 0.028 | 0.106 | 0.097 | 0.050 | 0.347 | 0.129 | 0.136 | 1.000 | 0.086 | 0.207 | 0.195 | 0.040 | 0.029 | 0.031 | 0.139 | 1.000 | 0.052 | 0.030 | 0.024 | 0.189 |
| reserved_room_type | 0.000 | 0.003 | 0.143 | 0.010 | 0.045 | 0.042 | 0.082 | 0.776 | 0.040 | 0.014 | 0.357 | 0.098 | 0.109 | 0.028 | 0.152 | 0.100 | 0.323 | 0.073 | 0.037 | 0.048 | 0.138 | 0.103 | 0.003 | 0.006 | 0.079 | 0.052 | 1.000 | 0.044 | 0.054 | 0.075 |
| stays_in_week_nights | 0.094 | 0.153 | 0.171 | -0.016 | 0.037 | 0.026 | 0.014 | 0.047 | 0.000 | 0.065 | 0.013 | 0.250 | 0.080 | 0.012 | 0.047 | 0.006 | 0.192 | 0.028 | 0.017 | 0.296 | 0.033 | 0.045 | -0.119 | -0.062 | 0.017 | 0.030 | 0.044 | 1.000 | 0.238 | 0.076 |
| stays_in_weekend_nights | 0.051 | 0.127 | 0.131 | -0.007 | 0.046 | 0.026 | 0.029 | 0.051 | 0.010 | 0.040 | 0.028 | 0.076 | 0.088 | -0.075 | 0.073 | 0.055 | 0.198 | 0.022 | 0.082 | 0.162 | 0.061 | 0.061 | -0.084 | -0.055 | 0.015 | 0.024 | 0.054 | 0.238 | 1.000 | 0.080 |
| total_of_special_requests | 0.196 | 0.162 | 0.016 | 0.003 | 0.053 | 0.019 | 0.091 | 0.066 | 0.060 | 0.042 | 0.061 | -0.128 | 0.097 | -0.123 | 0.220 | 0.070 | 0.046 | 0.265 | 0.040 | -0.074 | 0.210 | 0.062 | 0.025 | -0.130 | 0.044 | 0.189 | 0.075 | 0.076 | 0.080 | 1.000 |
Missing values
Sample
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | City Hotel | 0 | 21 | 2015 | September | 36 | 1 | 0 | 4 | 2 | 0.0 | 0 | BB | BEL | Online TA | TA/TO | 0 | 0 | 0 | A | A | 2 | No Deposit | 9.0 | NaN | 0 | Transient | 105.0 | 0 | 0 | Check-Out | 2015-09-05 |
| 1 | City Hotel | 0 | 20 | 2016 | September | 38 | 12 | 1 | 0 | 1 | 0.0 | 0 | SC | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 89.0 | 0 | 2 | Check-Out | 2016-09-13 |
| 2 | City Hotel | 0 | 2 | 2016 | March | 13 | 24 | 0 | 2 | 2 | 0.0 | 0 | SC | ESP | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 134.0 | 0 | 1 | Check-Out | 2016-03-26 |
| 3 | Resort Hotel | 1 | 6 | 2016 | April | 17 | 21 | 0 | 1 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | D | D | 0 | No Deposit | NaN | NaN | 0 | Transient | 73.0 | 0 | 0 | Canceled | 2016-04-18 |
| 4 | Resort Hotel | 0 | 40 | 2015 | August | 34 | 20 | 2 | 3 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | D | D | 0 | No Deposit | 250.0 | NaN | 0 | Transient | 176.8 | 1 | 1 | Check-Out | 2015-08-25 |
| 5 | City Hotel | 0 | 256 | 2017 | July | 29 | 21 | 1 | 2 | 2 | 0.0 | 0 | BB | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient-Party | 107.1 | 0 | 2 | Check-Out | 2017-07-24 |
| 6 | City Hotel | 1 | 77 | 2015 | July | 29 | 13 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 76.5 | 0 | 1 | Canceled | 2015-06-29 |
| 7 | City Hotel | 0 | 1 | 2016 | August | 32 | 4 | 0 | 1 | 2 | 0.0 | 0 | BB | BEL | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 151.0 | 0 | 1 | Check-Out | 2016-08-05 |
| 8 | City Hotel | 0 | 150 | 2017 | April | 14 | 2 | 2 | 2 | 2 | 1.0 | 0 | BB | FRA | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 135.0 | 0 | 2 | Check-Out | 2017-04-06 |
| 9 | Resort Hotel | 0 | 90 | 2017 | June | 26 | 28 | 2 | 5 | 2 | 0.0 | 0 | BB | IRL | Direct | Direct | 0 | 0 | 0 | A | A | 0 | No Deposit | NaN | NaN | 0 | Transient | 127.0 | 0 | 0 | Check-Out | 2017-07-05 |
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 119180 | Resort Hotel | 1 | 23 | 2016 | June | 25 | 18 | 0 | 1 | 2 | 0.0 | 0 | HB | ESP | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0 | Transient | 161.0 | 0 | 0 | Canceled | 2016-06-06 |
| 119181 | City Hotel | 0 | 4 | 2016 | January | 5 | 28 | 0 | 2 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | E | E | 0 | No Deposit | 14.0 | NaN | 0 | Transient | 127.0 | 0 | 0 | Check-Out | 2016-01-30 |
| 119182 | City Hotel | 1 | 286 | 2016 | October | 43 | 16 | 1 | 0 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 44.0 | NaN | 0 | Transient | 90.0 | 0 | 0 | Canceled | 2016-06-20 |
| 119183 | City Hotel | 0 | 193 | 2016 | September | 40 | 30 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | Corporate | 0 | 0 | 0 | A | A | 0 | No Deposit | NaN | NaN | 0 | Transient-Party | 132.0 | 0 | 0 | Check-Out | 2016-10-03 |
| 119184 | City Hotel | 1 | 20 | 2016 | November | 45 | 4 | 0 | 1 | 2 | 0.0 | 0 | SC | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | D | 0 | No Deposit | 159.0 | NaN | 0 | Transient | 100.0 | 0 | 0 | No-Show | 2016-11-04 |
| 119185 | Resort Hotel | 0 | 1 | 2016 | June | 26 | 21 | 0 | 1 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 104.0 | NaN | 0 | Transient | 79.0 | 0 | 0 | Check-Out | 2016-06-22 |
| 119186 | Resort Hotel | 0 | 17 | 2016 | October | 45 | 30 | 1 | 0 | 2 | 0.0 | 0 | BB | PRT | Groups | Direct | 0 | 0 | 0 | A | A | 0 | No Deposit | NaN | 346.0 | 0 | Transient-Party | 66.0 | 1 | 0 | Check-Out | 2016-10-31 |
| 119187 | City Hotel | 0 | 1 | 2017 | April | 17 | 27 | 0 | 1 | 2 | 0.0 | 0 | SC | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 160.0 | 0 | 0 | Check-Out | 2017-04-28 |
| 119188 | Resort Hotel | 0 | 10 | 2017 | June | 25 | 24 | 2 | 1 | 3 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | A | A | 1 | No Deposit | NaN | NaN | 0 | Transient | 185.0 | 1 | 0 | Check-Out | 2017-06-27 |
| 119189 | Resort Hotel | 0 | 56 | 2015 | November | 48 | 23 | 0 | 0 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | E | A | 0 | No Deposit | 240.0 | NaN | 0 | Transient | 0.0 | 0 | 1 | Check-Out | 2015-11-23 |
Duplicate rows
Most frequently occurring
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5398 | City Hotel | 1 | 277 | 2016 | November | 46 | 7 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | NaN | 0 | Transient | 100.0 | 0 | 0 | Canceled | 2016-04-04 | 180 |
| 4175 | City Hotel | 1 | 68 | 2016 | February | 8 | 17 | 0 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 0 | Transient | 75.0 | 0 | 0 | Canceled | 2016-01-06 | 150 |
| 5068 | City Hotel | 1 | 188 | 2016 | June | 25 | 15 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 119.0 | NaN | 39 | Transient | 130.0 | 0 | 0 | Canceled | 2016-01-18 | 108 |
| 4872 | City Hotel | 1 | 158 | 2016 | May | 22 | 24 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 31 | Transient | 130.0 | 0 | 0 | Canceled | 2016-01-18 | 101 |
| 3786 | City Hotel | 1 | 28 | 2017 | March | 9 | 2 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | NaN | 0 | Transient | 95.0 | 0 | 0 | Canceled | 2017-02-02 | 99 |
| 3844 | City Hotel | 1 | 34 | 2015 | December | 50 | 8 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 19.0 | NaN | 0 | Transient | 90.0 | 0 | 0 | Canceled | 2015-11-17 | 99 |
| 3900 | City Hotel | 1 | 38 | 2017 | January | 2 | 14 | 0 | 1 | 1 | 0.0 | 0 | BB | PRT | Corporate | Corporate | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | 67.0 | 0 | Transient | 75.0 | 0 | 0 | Canceled | 2016-12-07 | 99 |
| 4865 | City Hotel | 1 | 156 | 2017 | April | 17 | 26 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 0 | Transient | 100.0 | 0 | 0 | Canceled | 2016-11-21 | 99 |
| 4199 | City Hotel | 1 | 71 | 2016 | June | 25 | 14 | 0 | 3 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 236.0 | NaN | 0 | Transient | 120.0 | 0 | 0 | Canceled | 2016-04-27 | 88 |
| 4932 | City Hotel | 1 | 166 | 2016 | November | 45 | 1 | 0 | 3 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 236.0 | NaN | 0 | Transient | 110.0 | 0 | 0 | Canceled | 2016-07-13 | 85 |